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Abstract 

Wideband Code Division Multiple Access (WCDMA) is 
the dominant air interface for the Universal Mobile 
Telephone Service (UMTS) Frequency Domain Duplex 
(FDD) mode and for future IMTS wireless networks. The 
transmitted CDMA signals from Base Station(BS) 
propagate through noisy multipath fading communication 
channels before arriving at the receiver of the mobile 
station (MS). In contrast to classical single-user detection 
(SUD) algorithms, which do not provide the requisite 
performance for modern high data rate applications, 
optimal multi-user detection (MUD) approaches require 
a lot of a-priori information not available to the User 
Equipment (UE). We propose two adaptive approaches, 
namely RAKE-Blind Source Recovery (RAKE-BSR) and 
RAKE-Principal Component Analysis (RAKE-PCA) that 
fuse an info-theoretic stage into a standard RAKE 
receiver. This results in robust detection algorithms with 
performance exceeding the standard LMMSE detectors 
for WCDMA systems under conditions of congestion, 
imprecise channel estimation and unmodeled multiple 
access interference (MAI). 

1. Introduction 

Wideband CDMA (WCDMA) will be a dominant 
technology for third generation (3G) and future wireless 
communication systems and forms an integral part of the 
CDMA2000, 3rd Generation Partnership Project (3GPP) 
International Mobile Telecommunication Standard 
(IMTS-2000) [1-3]. In a CDMA system, the symbol train 
to a user may be detected using either a single-user or a 
multi-user detector. A single-user detector (SUD) such as 
Matched Filter (MF) detector, Zero Forcing (ZF) detector, 
RAKE [5, 10) 12] etc., does not model the multiple access 
interference (MAI) due to the presence of other users and 
extraneous signals in its signal path, rather it treats all 
interfering users and disturbances as additive noise. This 
severely limits its detection performance and SUD fails to 
provide the performance levels vital to modern high data 
rate applications [12], 

In contrast, a multi-user detector (MUD) includes all the 
users in the signal model. Significant improvement can be 



obtained with a multi-user receiver [5, 6, 9, 14, 15]. 
However the optimal MUD [12] is computationally 
intensive and requires several dynamic system parameters 
to be known precisely. Several other linear multi-user 
detection (MUD) techniques such as Best linear unbiased 
estimator (BLUE), linear minimum mean squared error 
(LMMSE) estimator [4, 8, 9, 1 5] have been proposed for 
the wireless downlink based on the linear convolutive 
channel model. In practical situations most of these 
estimators require extensive knowledge of channel 
parameters and massive computational power. One 
approach is to use sub-optimal block-level 
implementations, where performance degrades quickly if 
the a-priori channel estimates are far from true system 
parameters. Adaptive approximations of these algorithms 
also require some a-priori information for correct 
initialization and may not match the performance of the 
algebraic counterparts under all conditions. 

For the proposed 3GPP UMTS FDD [1, 3] and future 
tightly synchronous CDMA standards, the user specific 
spreading is done by Orthogonal Variable Spreading 
Factor (OVSF) codes, while the cell specific complex 
scrambling is done using a set of long gold sequences 
(and/or Kasami sequences) [3]. Long codes have their 
own advantages such as code-hopping which results in 
similar performance for all users in the system (i.e., better 
QoS). In addition long codes also result in improved 
power control for users with relatively smaller data rates 
in the system and better rejection of extra-cellular 
interference (i.e., lower effective BER) [6]. Contrarily, 
the use of long scrambling codes makes the 
implementation of the exact linear detection algorithms 
impractical due to the excessive computational 
requirements [8, 15]. 

In this paper we propose two adaptive approaches, 
suitable for the WCDMA downlink with long scrambling 
code, namely RAKE-Blind Source Recovery (RAKE- 
BSR) and RAKE-Principal Component Analysis (RAKE- 
PCA) that fuse an info-theoretic stage into a standard 
RAKE receiver. This results in robust detection 
algorithms with performance exceeding the standard 
LMMSE detectors for WCDMA systems under 
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conditions of congestion, imprecise channel estimation 
and unmodeled MAI. 

2. UMTS FDD Forward Link (or Downlink) 

Consider the forward Unkfor a modern wireless 3G 
UMTS FDD (best known as WCDMA) cellular mobile 
communications network [1-3]. User specific data is 
delivered to the physical layer in blocks often termed as 
Transport Blocks (TB) [2], which include bits for CRC, 
FEC etc. The transport channel multiplexer (or a parallel- 
to-serial converter PIS) converts all the data to a single 
BPSK stream. After interleaving and addition of physical 
channel information for power control (TPC) y pilot, TFCI 
etc., the data stream comprises of two dedicated logical 
channels, one for data (DPDCH) and one for 
corresponding control (DPCCH) information, in 
multiplexed form they form one user specific dedicated 
physical data channel {DPCH) [3]. However, note that in 
case several data streams are associated to a, single user, 
they can share the same logical control channel. Before 
transmission, the data goes through a serial-to-parallel 
{SIP) converter for QPSK modulation. 




Figure 1: Signal generation based on the proposed 
3GPP UMTS FDD standard 



Each user specific stream is first spread using a relatively 
short (4-512 chips) OVSF channelization code C ch (or 
real user-specific spreading code,^(/)). The parallel 
channels are then mapped to the I and the Q branches. At 
this stage the data streams for all the users in a cell are 
summed together. This combined QPSK stream is 
subsequently scrambled by a long (38400 chips) 
scrambling code C scra mb ( or cell-level complex gold code, 
c{t) ). The collective data for all users is then modulated 
and synchronously transmitted via the transmission 
medium, which we assume to be a wide sense stationary 
slowly fading multipath frequency selective channel [1 - 
3]. 

All physical channels transmitted from a BS are mutually 
orthogonal at the time of transmission as the spreading 
codes are the OVSF (or Walsh Hadamard codes), which 



are confined to the symbol period only. Different users 
can have different data-rates, which is accommodated by 
assigning different length OVSF spreading sequences 
s k (t) in a global prefix-free code assignment tree [3]. 
The downlink system is designed to be tightly 
synchronous at the frame level [1]* The intra-cell multiple 
user signals sharing the same multipath environment are 
considered to be the main cause of the Multiple Access 
Interference (MAI) in the received signal for a desired 
user [5, 12], 

2.1 WCDMA Downlink Signal Model 

The received signal at the mobile receiver within a 
transmitted frame of length T F can be represented by 

K0=LiZv^^^( w )^(^(^.^-^)+ "(00) 

n=\ fc=l M) 

Where is the energy of the symbol for the user, 
b k {n) € {±1 ± ;} is the n th symbol for the H k user, h } and 
r { are the /* path's gain co-efficients and delay, 
respectively. n(t) is the additive noise and s k {t) is the k? h 
user's signature code (or spreading sequence) generated 
by 

(7-1 

s k (t)=^a k (m)p(t-mT c ) (2) 

a k {m) e{-l,l}; 0<w<G-l is a real OVSF spreading 
sequence for the U h user containing G chips per symbol, 
i.e., G = T b jT c , p(t) is a chipping pulse of duration T c . 
T b being the symbol period. c(t)e{±\±i) is the 

complex cell-specific gold scrambling sequence [3]. 
Under the assumption of time-invariance, the model in (1) 
can be compactly written in a vector-matrix format as 

r=HCSb+n (3) 
Where, H is a (NG + L-\)xNG multipath propagation 
co-efficient matrix containing the channel coefficients. S 
is a NGxNG block diagonal matrix with the matrix of 

spreading codes forming the diagonal elements, b is an 
NG -d vector containing the user symbols, while n is the 
(M?+Z,-l)-d channel noise vector with covariance 
matrix Q* C is the NGxNG complex diagonal 
scrambling matrix. N being the number of symbols within 
a frame time 7>. The structure of the above defined 
matrices rind vectors is given by 

' 0 ». 0 * 
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b=[b(\) T b{2? KAO T f, 

C = <#ag[C] c 2 • c NG ]; c ( - e {±1 ± i) ; 1 £ 1 < NG 

The compact linear model (3) is useful in deriving 
the linear detectors such as matched filter (MF); linear 
minimum mean squared error (LMMSE) etc. (see, ; section 
3) for recovery of the transmitted symbol train for a 
desired user equipment (UE). However, the size of the 
above model is excessively large due to the dimension of 
the long scrambling code. A number of approaches will 
be discussed in the subsequent section to recover the 
desired user's symbol train at UE/MS while keeping in 
mind the constraints of performance and iease of 
implementation. * 

2.2 Channel Estimation Using The Common Pilot 
Channel (CPICH) 

Good Channel Estimation is vital to the performance of 
most conventional detection schemes. WCDMA includes 
provision of Common Pilot Channels (CPICH), which 
may be implemented for active, channel estimation. 
CPICH is a fixed rate (30 kbps, G=256) downlink 
physical channel that carries a pre-defined bit sequence 
[1]. If applicable, CPICH allows the UE/MS to equalize 
the channel in order to achieve a phase reference with the 
SCH {Synchronization Channel). The standard defines a 
fixed channelization code for the Primary CPICH. 

2.2. 1 Channel Estimation Procedure 

In order to outline a simple and practical channel 
estimation technique using CPICH [5], we assume that 
the transmitted pilot comprises of a stream of a single 
symbol train of 1 - i , and uses an all ones spreading code 
(i.e., the first OVSF code) 

1. For each independent multipath, multiply the 
incoming symbol train with the corresponding 
delayed scrambling sequence. 

2. Remove the modulation form CPICH by simply 
multiplying the CPICH data by its conjugate, i.e., 
1 + j\ The resulting channel estimate is noisy because 
of AWGN and multiple-access interference. 

3. Pass the noisy channel estimate through a smoothing 
filter to achieve better noise immunity. This filter can 
be either a moving average window of iength 2M+ 1 , 
e.g., 

M 

=— y p» (4) 



1 

= 2M + l 1 § / i 



a relatively smaller factor M followed by a single-zero 
smoothing filter 

h. = (l-p)fy + p/J,; where 0< p < 1 (5) 
4. Decimate or interpolate the filtered channel estimate 
obtained in step 2 to match the data rate of the CPICH 
to the data rate of the DPCCH/DPDCR 



Smoothing 




Data Rate 


Filter 




Matching 



Channel Estimate 



{CPICH)" 



Since, a moving UE/MS represents a dynamic 
channel, the channel estimates may be processed using 



Fig. 2. Channel Estimation Using CPICH 

This simple technique works well in most cases because 
the channel is assumed to be stable for the symbol 
duration. 

3. WCDMA Detection Structures 

In this section, we first briefly describe the linear 
detection schemes such as the matched filter (MF), 
multipath version of matched filter or RAKE receiver and 
the linear minimum mean squared error (LMMSE) 
detector [5, 12, 15]. Other linear detector formulations 
such as the least squares (LS) or the zero forcing (ZF) 
detector, best linear unbiased estimator (BLUE) detector 
are 'not discussed as their performance is worse as 
compared to the LMMSE detector, see [8, 10, 12, 15] for 
more details. 

3JA Matched Filter (MF) Detector 

The standard matched filter is a single user detector, 
which just utilizes the user's, own signature sequence to 
make the best possible estimate of user's transmitted 
sequence from the raw chip data received at the UE/MS. 
The detection algorithm completely ignores the presence 
of convolutions due to multipaths in the in the receiver 
environment as well as the MAI due to additional users 
sharing the resources. In case of WCDMA system, the 
user's symbol undergoes spreading as well as scrambling, 
therefore the MF detector becomes 

(«) = *?C H (»)r ((» -1)G + 1 : nG) (6) 
where C H («) = [diag [c (lM)CW • ■ • c nG ]}" 
3,2 RAKE Detector 

The RAKE detector, in CDMA is created using multiple 
chip-delayed MF detector fingers in parallel. The RAKE 
receiver first identifies three or more (depending on the 
number of implemented fingers) strongest multipath 
signals arriving at the receiving antenna using a maximal 
ratio path search algorithm [5, 12]. These multipath 
signals are combined after adjustment for their 
corresponding delays (and possibly phase and path 
attenuation if channel estimation is available) to produce 
a relatively stronger received signal chip stream. In this 
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work, we assume that the RAKE receiver to be 
implemented with the- knowledge ■. of . both channel 
multipath delays (maximal ratio path search) as well as 
the corresponding multipath attenuation co-efficients 
(channel estimation). The RAKE detector for WCDMA 
system is given by 

%Mke (») = s?C H {n)H H {n)r{(n -I)G + l: nG) (7) 
where, C H (n) is as defined above , and H H (n) is the 
hermitian of the estimated channel estimate filter. 

3.3 Linear Minimum Mean Squared Error 
(LMMSE) Detector. 

The optimal LMMSE detector is considered to be the best 
linear detector for DS-CDMA reception. While other 
linear detectors such as LS, ZF, BLUE etc. do not provide 
good performance in the presence of excessive noise 
(especially colored noise). LMMSE detectors do a trade- 
off by not performing perfect orthogonalization of 
received signal stream at low SNRs by trying to minimize 
additive noise variance. The LMMSE detector for the 
WCDMA system is given by 

KlMMSE = &?C H ii H {<J 2 M H +&r X r . (8) 

where 

S^diag^ S t S,],S,=[0 "< s, < ■ 0], j, 
being the f 1 user's signature code and 
R = E[?7 H ^ = (a 2 HH H +Q) is the auto-correlation 
matrix for the signal chip train received at the UE/MS, 
er 2 being the average power of transmitted user signals. 
Note that the dimension of H is impractical for the 
implementation purpose. Typically LMMSE is 
implemented on wG-sized blocks of received data and is 
called block LMMSE (B-LMMSE). LMMSE detector can 
typically be implemented by adding a pre-processing 
stage before RAKE receiver. The data correlation matrix 
may be adaptively estimated resulting in adaptive 
LMMSE implementations [4, 6, 15]. 

3.4 RAKE-Blind Source Recovery (RAKE-BSR) and 
RAKE-Principal Component Analysis (RAKE-PCA) 
Detectors 

RAKE-BSR and RAKE-PCA are two new proposed 
adaptive detectors [14], which utilize the same knowledge 
as a RAKE receiver. An info-theoretic adaptive weighting 
matrix of dimension GxG is introduced into the RAKE 
structure, which gives a big performance boost to the 
standard RAKE receiver. The performance of RAKE- 
BSR/RAKE-PCA exceeds the performance of LMMSE 
detectors under the conditions of high network 
congestion, imprecise channel estimation, and unmodeled 
inter-cellular interference etc [14]. The closed form 
structure of these proposed detectors is given by 



^iMKE-iCAlPCA = C H WH H 7 (9) 

where W~diag[A A ••• A], and A is the GxG 

matrix that is adaptively estimated either using static BSR 
(ICA) or PCA algorithms. 

We propose to adapt the matrix A using the natural 
gradient update laws [11, and the references therein]. 
However, there exist several other methods for ICA/PCA 
and any other suitable method may be used for these 
adaptations [7, and the references therein]. This blind 
adaptation of the A matrix has several advantages and 
improves the performance of the overall equalization 
process in several ways. Firstly, it can counter artifacts in 

the estimated channel co-efficients H . Secondly, the 
channel estimation process (as in RAKE receivers) may 
be limited by the structure (such as number of fingers) 
and may estimate only a few of the dominant channel 
parameters. W stage in (9) tends to counteract this 
anomaly, as best as possible, and provides better 
performance than LMMSE in such cases [14]. Thirdly, 
this adaptive stage minimizes the effect of the additive 
channel noise, which may have an unmodeled intricate 
underlying structure- Lastly, the natural gradient ICA 
/PCA algorithms inherently reduce near-far problems by 
removing any ill conditioning in the signal space for all 
the users in the system. This results in all the mobile users 
in the system to have a BER performance similar to the 
average BER performance of the downlink channel [6, 
14]. The matrix .4 is adaptively estimated using the update 
laws 

A{k + l)~A(k) + r} k bA(k) (10) 
where 

a.„x \(I-<p(y(k))y(kf)A(k) for static BSR/ICA 

[ (I-y(k)y(kf)A(k) : for PCA 

and tp(.)is a nonlinear score function [12, 13] which 
depends on the underlying distribution structure of the 
signals involved. For QPSK signal, a suitable score- 
function [13] is 
^O^-^-^tanh^ 

Of these proposed RAKE-BSR/RAKE-PCA structures, 
RAKE-BSR exhibits relatively faster and more stable 
convergence [12, 14]. However, in CDMA systems the 
underlying code structure is "orthogonal", and RAKE- 
PCA may converge to a slightly lower BER solution if the 
channel impairments are linear in nature. Note that in 
(RAKE-1CA), if the channel estimate H [5] is either not 
available or changes very dynamically, the detector can 
be estimated without using the channel estimate and the 
structure reduces to Matched Filter BSR/PCA, i.e., 
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The performance of this structure is better than MF alone, 
and approaches RAKE performance as the underlying 
matrix A converges. However, in this paper due to space 
limitation we will not discuss this structure any further. 

4. Simulation Results 

The proposed algorithms can be easily applied to 
multirate WCDMA transmission. However, for a clear 
comparison of the proposed algorithms with the 
conventional approaches, we restrict ourselves to the case 
where all the users have the same data rate. All the user- 
specific codes are OVSF with a spreading gain G of 64. 
The long scrambling code is the complex gold code and 
has the frame-length of 38400 chips (10ms). The channel 
is assumed to be a wide sense stationary slowly fading 
with several possible multiple paths to the UE/MS. The 
transmitted signal is also corrupted by additive white 
Gaussian noise (AWGN) during the transmission. The 
simulated SNR range for all simulations is from -10 to 20 
dB. The dominant multi-path delays and attenuation co- 
efficients for the signal received at the UE/MS are 
assumed fixed (i.e., the UE/MS is assumed to be static). 
Further all the multi-path attenuation co-efficients are 
assumed to be complex, i.e., each multipath applies both 
scaling and rotation to the propagated signal. 
For all the included simulations, the received WCDMA 
signal comprises of five multipaths with delays of 0 t 1 } 2, 
3 and 4 chips. The corresponding channel attenuation co- 
efficients are chosen to be, 

h=[0.3l|3£ 0.26|34° 0.2l[31 Q Q.18|37 Q 0.13|34°] 

The multipath channel is estimated online. The UE/MS 
and BS are assumed to be in perfect synchronism. A 
decaying time-adaptive learning rate is chosen for all the 
adaptive algorithms. The B-LMMSE algorithm is applied 
on a block size of G chips; the auto-correlation matrix 
(for the B-LMMSE algorithm) 1 is computed from the 
whole ensemble of the received data, The conjugate of 
the channel filter H> is applied recursively. These steps 
were done to ensure that the performance of the simulated 
B-LMMSE is not restricted due to implementation. We 
provide results for two congestion scenarios of 20 users 
(approx. 30% congestion) and 50 users (approx 80% 
congestion). The final symbol decision stage for all 
algorithms is given by 

P<y») = sign(te{y n }) + sigfl(lm{y n })i (13) 

In the first comparison (Fig. 3a), the channel estimation is 
done for all the multipaths in the signal generation model. 
It is observed that the performance of the proposed 
algorithms RAKE-PCA and RAKE-BSR is very similar 
to B-LMMSE for lower SNR values. Under good SNR 
conditions, the performance of the B-LMMSE algorithm 
is better than the proposed algorithms. 



However, note that for the B-LMMSE the auto- 
correlation matrix is estimated using 5000 symbols, while 
only 1000 instantaneous adaptations (for K = 50) are done 
for the proposed algorithms. In case more adaptations are 
done, the performance of proposed algorithms will 
approach the LMMSE limit, but even the performance 
attained with a limited number of iterations exhibits then- 
effectiveness. 
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Figure J. WCDMA Downlink System for K=20, 50 
users, Performance Comparison with: (a) Perfect 
Channel Estimation, (b) Imperfect Channel Estimate, 
(c) Imperfect Channel with Inter-Cellular Interference 

In the second comparison (Fig* 3b), the channel estimate 
is restricted to the three dominant paths only, which 
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correspond to the choice of delays 0, 1 and 2 in this case. 
In this case it is observed that the detection performance 
of the proposed RAKE-BSR and RAKE-PCA algorithms 
exceeds B-LMMSB at all SNRs. Comparing both RAKE- 
PCA and RAKE-BSR, it is observed that the performance 
of RAKE-PCA is approximately 1% better than RAKE- 
BSR. But RAKE-BSR has other advantages (as discussed 
in section 3.4) of stability and smaller energy of the 
computed filtering matrix W . Another important 
observation in this case is that at very low SNRs, RAKE 
gives the best performance, but as the SNR improves the 
proposed algorithms can achieve very small BER* 
Therefore, an SNR based switching mechanism can be 
developed to switch between the standard RAKE and the 
proposed algorithms, which just constitute an additional 
adaptive stage in the structure of the RAKE receiver. 
In the third scenario (Fig. 3c), both the channel estimate is 
assumed to imprecise as in the previous comparison with 
only three dominant multipaths out of five estimated. In 
addition, the received signal is corrupted by extra-cellular 
signals of the neighboring cell BS. This is a realistic 
scenario in busy metropolitan areas, where there exist 
several dominant multipaths and the cell size is also kept 
relatively small to maximize the number of users per unit 
area. Inter-cellular interference is also critical when the 
UE/MS is on the cell boundary and undergoes a soft (or 
soft-softer) hand-over. The intercellular interference is 
unmodeled MAI and severely limits the performance of 
the detection algorithms. For the purpose of simulation 
this auxiliary BS interference is assumed to have half the 
energy of the primary BS, It is observed that the proposed 
RAKE-B SR/RAKE-PC A algorithms exhibit better 
immunity to this excessive unmodeled MAI and retain 
their qualitative performance advantage over the 
conventional algorithms. 

5. Conclusions 

We have proposed two info-theoretic extensions of the 
standard RAKE detector namely RAKE-B lind Source 
Recovery (RAKE-BSR) and RAKE-Principal Component 
Analysis (RAKE-PCA). These detection schemes add an 
adaptive info-theoretic stage, based on higher-order 
statistics for RAKE-BSR and second-order statistics for 
RAKE-PCA, to the standard RAKE detector. This makes 
the resultant hybrid detector more robust to imperfections 
in channel estimates; unmodeled MAI and other slowly 
varying channel effects etc. Of these proposed algorithms, 
RAKE-BSR is more immune to synchronization errors 
between UE/MS and BS, possesses faster convergence 
and is better capable to cater for various dynamic and 
time-varying channel effects as compared to RAKE-PCA. 
However, for the presented WCDMA results RAKE-PCA 
demonstrates a slight performance advantage. This is due 
to the fact that the simulated channel imperfections are 



only linear, the channel synchronization is perfect and the 
underlying code structure is orthogonal. 
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